Picture for Yuxin Peng

Yuxin Peng

AesFormer: Transform Everyday Photos into Beautiful Memories

Add code
May 21, 2026
Viaarxiv icon

Beyond Binary Success: A Diagnostic Meta-Evaluation Framework for Fine-Grained Manipulation

Add code
May 19, 2026
Viaarxiv icon

FIKA-Bench: From Fine-grained Recognition to Fine-Grained Knowledge Acquisition

Add code
May 13, 2026
Viaarxiv icon

BadmintonGRF: A Multimodal Dataset and Benchmark for Markerless Ground Reaction Force Estimation in Badminton

Add code
May 03, 2026
Viaarxiv icon

OmniVTG: A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Grounding

Add code
Apr 28, 2026
Viaarxiv icon

Taxonomy-Aware Representation Alignment for Hierarchical Visual Recognition with Large Multimodal Models

Add code
Feb 28, 2026
Viaarxiv icon

Venus: Benchmarking and Empowering Multimodal Large Language Models for Aesthetic Guidance and Cropping

Add code
Feb 27, 2026
Viaarxiv icon

TiFRe: Text-guided Video Frame Reduction for Efficient Video Multi-modal Large Language Models

Add code
Feb 09, 2026
Viaarxiv icon

Fine-R1: Make Multi-modal LLMs Excel in Fine-Grained Visual Recognition by Chain-of-Thought Reasoning

Add code
Feb 07, 2026
Viaarxiv icon

Multi-Resolution Alignment for Voxel Sparsity in Camera-Based 3D Semantic Scene Completion

Add code
Feb 03, 2026
Viaarxiv icon